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Abstract 

We propose a parallel introduction to Galilean and Einsteinian relativity based on the causal 
structure and inertial motions. Galilean and Poincare transformations, as objects secondary to the 
geometrical structure, are left aside. 
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I. INTRODUCTION 



This article is intended for university level teachers lecturing, and students learning, 
special relativity (SR). It is not meant as a text which could be directly used as a SR primer. 
Rather it gives a background, or an outline on which one can elaborate the exposition of SR. 
We assume as a background for this article a course in linear algebra including real vector 
and aflfine spaces, direct sums of subspaces, linear forms and symmetric bihnear forms of 
any signature. 

We propose a highly structured and logical approach to the fundamentals of SR based 
on its causal structure and relativity of inertial motions. For comparison and better under- 
standing we parallelly build the Galilean spacetime (GS) on similar ideas. We indicate that 
the causal structure determines the metric structure of SR spacetime uniquely, which is not 
the case for the choice of Euchdean metric in the Galilean case. 

We want to stress the point that the Galilean and Lorentz (Poincare) transformations are 
objects secondary to the geometric structure of spacetime: they are aflfine mappings leaving 
this structure invariant. We regard basing the introduction to SR on these transformations 
as a serious misconception and we do not discuss them in this article. 

We are also of the opinion that introducing SR, for the sake of alleged simphcity, from the 
three-dimensional rather than full geometrical point of view, in fact makes understanding of 
SR more difficult, and can easily lead to misconceptions. We regard as especially harmful 
figures illustrating hypothetical relative motion of frames as depicted in Fig. 1. Whereas 
this is not the best, but correct picture in GS, it is completely wrong in SR. The reason for 
that is that the hyperplanes of constant time ('pure space') of observers in relative motion 
are not parallel, so they cannot be regarded as 'sliding' on each other. 

Elements of the programme sketched above appeared, of course, in many earlier publica- 
tions and books (see e.g. Refs. 1-3) but we believe that our scheme adds some value to the 
clarity and logic. 

In addition we discuss some simple geometric effects in the present context. This will 
include a discussion of the view of the celestial sphere as seen by different observers.!^ This 
point is particularly worth adding, as it is usually treated with the help of a rather indirect 
method of stereographic projection.^^ We discuss it directly on the celestial spheres of two 
observers. 
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In all discussions of effects involving different observers we consistently avoid, as men- 
tioned above, the use of Galilean or Lorentz transformations. To relate the views on the 
spacetime as seen by two inertial observers one needs only to know the directional vectors 
of their world-lines. On the other hand one needs complete bases attached to the observers 
to specify a transformation between them. 

II. HOMOGENEITY WITH RESPECT TO TRANSLATIONS AND THE AFFINE 
STRUCTURE 

It is fairly obvious from everyday experience that one needs four real numbers to place an 
event in space and time. For a given event the specific values of these numbers depend on 
an adopted system of labels, but they always form an element of the set M^. Our spacetime 
is a structure based on this set. 

Another common experience points to the applicability of spacetime translations: if 
a physical occurrence takes place in a given region of space and within some time-span, 
an analogous occurrence may take place elsewhere and at another time. We include this 
property in our construction of a model of the spacetime in the following form: the group 
of four-dimensional translations acts transitively on the spacetime. This leads us to the 
following starting point for the construction of a spacetime model: 

Flat spacetime is modeled by a real four-dimensional affine space {M.,M). 

Here M. denotes the affine space based on the four-dimensional vector space M. We adopt 
the notation P,Q, . . . for points in M and x,y, . . . for vectors in M. We write x — PQ if 
Q — P + X. Moreover, if P e and N C M is any subset then we use the usual 
shorthand: P + N — {P + x\ xE N}. In particular, straight hues are one-dimensional 
affine subspaces P + L{x), where L{x) denotes the one-dimensional vector subspace spanned 
by the vector x. Ordered vector bases in M will be denoted by (60,61,62,63). See Fig. 2 for 
a graphic representation (here, as in the following, one space dimension is omitted). 

III. CAUSAL STRUCTURE AND INERTIAL MOTIONS 

Of course, the affine space structure is still a very poor one, one needs further specification. 
The most obvious element needed is a one introducing the differentiation between physical 
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time and space directions. This is achieved in the following way. 

We shall say that the spacetime is equipped with the causal structure if in the accompa- 
nying vector space one has distinguished the following set (see Fig. 3): 

GS: a three-dimensional subspace S C M, 

SR: a homogeneous vector quadric V G M (different from a subspace), with respect to 
which three dimensions of M are on equal footing, but not the fourth. 

By a homogeneous vector quadric we mean here a set of vectors x E M whose coordinates 
x^,x^,x'^,x^ in some (and then any) basis satisfy the equation Ylt j =0*^13 — with some 
basis-dependent numerical coefficients aij. We recall that for any such quadric there is 
a basis in which it takes one of the forms eo{x^y + ei{x^y + £2(2;^)^ + £3(2;^)^ = 0, where 
En — 0, ±1 (uncorrelated values). The only possibility (up to a permutation of the basis 
vectors) to satisfy the demand imposed above on V is that in this canonical basis F is a cone 
given by: 

xeV ^ {xy - {x'Y - {x^ - {x^y = . (1) 

We shall say that a vector x lies inside (or outside) V if (x°)^ — {x^Y — {x'^Y — {x^Y > 
(< 0) respectively. 

We say that a nonzero vector is a causal vector if it: 

GS: does not he in S, 

SR: lies inside or on V. 

In addition we introduce the notion of a timelike vector which 

GS: is identical with a causal vector, 

SR: lies inside V. 

We shall say that two events P and Q are causally related if PQ is a causal vector, and 
they are temporally related if it is a timelike vector. 

The causal structure makes contact with physics by the following identification. An 
inertial motion is a straight fine in spacetime A4 with a timehke directional vector (thus 
any two events on this line are temporally related). Such lines will be called world-lines of 
the motion (see Fig. 4) 
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If a point Q P is not causally related to P we say that it lies elsewhere with respect 
to P. One then cannot reach Q from P by an inertial motion. 

IV. THE FOUR ORIENTATIONS OF THE SPACETIME 

Let us choose a basis of M in which 

GS: the subspace S is given by x° = 0, 

SR: the cone V has the canonical form. 

The set of causal vectors splits into two disjoint sets: those for which a;° > or < 
respectively in the distinguished basis. We denote one of these sets by C+ and call it the 
future and the other by C_ and call it the past. (After this choice has been done we can 
adjust the sign of x° so that > ior x E C+.) Then the future (past) of any event P is 
the set P + C+ (P + C_), and Q is in the future of P if, and only if, P is in the past of Q. 
Let us write Q > -P for "Q is in the future of P", and Q > P for: Q > P ov Q = P. Then 
the relation Q > P defines a partial order in M.: 

r p>p, 

2° if Q > P and P > g then Q = P, 

3° if P > Q and Q > P then R>P. 

The only less obvious of these properties is the third one in the special relativity case. To 
prove it observe that a; G C+ if in a canonical basis < > Ay(x^7^~+7a?^P~+aJ^. If y is 
another such vector then it is easily seen that the same relation is satisfied with x replaced 
by x + y, which was to be proved. See Fig. 5 for a graphic representation of causally defined 
regions. 

As there are two possible choices for the identification of the sets C± we say that there 
are two possible causal orientations of the spacetimc A4. 

At the same time M as a real vector space has two possible orientations defined as usually 
as the equivalence classes of bases. In combination with the causal orientation this gives 
four choices of the spacetime M. orientations. 
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V. RELATIVE REST, INERTIAL OBSERVERS, INERTIAL FRAMES 



We do not have yet any metric tools, so we are unable to determine relative velocity of 
inertial motions, but we can already say what it means that two motions are in relative rest: 
their world-lines are parallel (i.e. have common directional vectors). 

We decide that there is no need to differentiate between an inertial motion and an often 
used term of inertial observer; the difference, if any, is a rather psychological one. 

Finally, by an inertial frame we mean the class of all inertial observers remaining in 
relative rest to each other. We do not see the need to make this notion more specific, as 
is often assumed, by demanding that a particular basis has been chosen with the timelike 
vector along the world line of the motions in this family. 

VI. METRIC STRUCTURE, FOUR- VELOCITIES 

We recall two facts from linear algebra: 

1° The kernel (zero space) of a nonzero linear form on a vector space is a subspace 
of codimension one. Conversely, any such subspace S determines uniquely up to 
a constant factor a linear form Dt such that 

xeS ^ Dt{x) = 0. (2) 

2° A real vector quadric V (if different from a subspace) determines uniquely up to 
a constant factor a symmetric metric g such that 

xeV g{x,x) = Q. (3) 

A proof of the second fact for the case of our cone V is given for completeness in the 
Appendix. 

A. Galilean spacetime 

In the case of the Galilean spacetime we chose the sign of Dt by demanding that 

Dt(x) > for xeC+. (4) 
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Then Dt{PQ) > if Q lies in the future of P. The remaining positive factor in the definition 
of Dt is fixed arbitrarily. For an arbitrarily chosen point Pq we fix a real value t{Po). Then 
there is a unique affinc form taking this value at Pq and having Dt as its linear part. This 
means that for each pair of points P, Q there is 

t{Q) = t{P) + Dt(PQ) . (5) 

This form determines the universal time in the Galilean spacetimc. The metric structure of 
this spacetime is now completed by choosing a Euclidean metric h on the subspace S. This 
metric then determines 'spatial' metric relations on each hyperplane Q + S of constant time. 
One notes that there are no relations of this kind between points on different constant time 
planes. Note also that the relative scale of the metric tools Dt and h is arbitrary. See Fig. 6 
for graphic representation of the metric structure of GS. 

The world-lines of inertial motions pierce precisely at one point each of the constant time 
hyperplanes. For each family of parallel inertial motions there is a unique directional vector 
u for which Dt{u) — 1. We shall call such vector a unit timelike, future-pointing vector or 
the four-velocity of these world-lines. 

Having chosen a particular family of inertial parallel motions characterized by the four- 
velocity u one can split the vector space into time and space parts by 

M = L{u) ®S, (6) 

where L{u) denotes the onc-dimcnsional subspace spanned by u. Observers in the chosen 
family decompose each vector x into the time and space parts by 

X — Dt{x)u -\- Xu , so Xu & S . (7) 

Note that while Dt{x) does not depend on the space part Xu does depend on this vector, 
that is to say on the family of parallel inertial motions. The Euclidean scalar product h can 
be applied to the space parts of any two vectors x and y and we shall also write 

h{^u, Vu) ^Xuoyu. (8) 

B. Special relativity 

In this case g is fixed up to a real factor by the cone V, as described above. We choose 
its sign by the convention that in the canonical basis of V the metric has the signature 
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(+1,-1,— 1,-1). The remaining positive factor is chosen arbitrarily. The metric structure 
of the spacetime is determined completely by g. The vector x is a timelike vector when 
g{x,x) > 0, and it is a causal vector when it is nonzero and g{x,x) > 0. In addition we say 
that a vector is spacelike if g{x,x) < 0. We shall also use the notation 

g{x,y) = x-y, x ■ x = x^ . (9) 

See Fig. 7 for the metric properties of vector types. 

If Q lies in the future of P then there is a unique inertial motion joining them. The 
proper time interval covered by this motion from P to Q is determined by 

^r{P,Q)=[g(PQ,PQ)\"\ (10) 

Let u = XPQ with A > so that u G C+. If we demand that g{u,u) = 1 then u is fixed 
uniquely by these conditions and A = [g{PQ, PQ)] We call such u a unit timelike, 
future- pointing vector or a four-velocity. 

A four-velocity u may be used to define a time variable correlated with the inertial frame 
defined by u. As in the Galilean case we fix t„(Po) and then there is a unique affine form t„ 
taking this value at Pq and having the linear form 

Dtuix)=u-x (11) 

as its linear part. This means that for each pair of points P, Q there is 

tu{Q)=tu{P) + Dtu{PQ). (12) 

Note that if P and Q he on one w-world-line, Q in the future of P, then 

DtuiPQ) = Ar(P, Q) (13) 



so the definition of Dtu is an extension of the proper time interval on a m- world-fine, Eq. (10). 

Let us denote by Su the kernel of the form Dt„, which is the subspace of vectors orthogonal 
to u with respect to the metric g. Then the hyperplanes P -\- Su are the sheets of constant t„ 
time. The metric g when restricted to Su reduces to — where is a Euclidean metric. 
Thus the objects Dtu, tu, Su and hu play a similar role as Dt, t, S and h in the Galilean 
case, but with several important differences: 
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1° Here these quantities are not universal as in the Gahlean case, they are functions of 
the vector u; thus they depend on the choice of a family of inertial observers in relative 
rest. 

2° This relative character implies weaker status of these quantities as compared to the 
Gahlean case. 

3° On the other hand the form Dtu and the metric hu are uniquely determined by g, so 
their relative scale is unambiguous. This is to be contrasted with the Galilean case, 
where the scale of Dt and h could be fixed independently. 

The decomposition of the vector space M into time and space parts takes now the form 

M = L{u) ®Su, X = Dtu{x)u + Xu, XueSu, (14) 

see Fig. 8. Note that in this case both Dtu{x) and Xu depend on u, and for different choices 
of this four-velocity the space parts x^ lie in different subspaces. For Xu,yu £ Su we shall 
write XuOfju — • Uu ^-nd also denote \xu\ — ^JXy, o Xu- Then 

X -y ^ Dt^{x)Dtu{y) - Xuoy^, x'^ ^ {u ■ xf - \xy\^ . (15) 

The scalar product, in contrast to the Gahlean case, is apphcable to any vectors. See Fig. 8 
and 9 for a graphic representation of decompositions and four-velocities, and Fig. 10 for the 
dependence of Su on u. 

VII. EQUIVALENCE OF OBSERVERS, LIGHT SIGNALS AND THEIR SPEED 

The principle of relativity, i.e. of the equivalence of observers, can be now put in the 
following form: 

1° Physical theories do not depend on the choice of the inertial frame, i.e. of the four- 
velocity u determining all inertial motions in a given family. 

2° The set of physical states conforming with physical theories does not distinguish any 
of the inertial frames. 

In particular: 



9 



1° In SR the Maxwell equations imply that the light signals propagate along straight lines 
whose directional vectors lie on V, i.e. I is such a vector iff g{l, I) — 0. These vectors are 
called therefore lightlike vectors and V is called the light- cone. The Maxwell equations 
do not conform to the principle of relativity in the GS case. In this case the only way 
to avoid clash with the principle of relativity is to assume that light propagates with 
infinite speed, i.e. the directional vectors of hght rays he in S. 

2° If one defines physical units of time and space in each inertial frame with the use of 
analogous physical phenomena then the proportion of these units to the geometrical 
units defined by Dt and h in the case of Galilean spacetime, and g in the case of SR, 
is the same for all observers. 

3° In the SR case if / is lightlike and u is any four-velocity, then \Dtu{l) \ = \lu\ - light 
covers in each inertial frame a unit distance in a unit time in geometrical units. If one 
determines physical units as in the preceding point their ratio gives the speed of light 
in all inertial frames in those units. 

Note that the geometrical objects of the spacetime include beside metrical tools also the 
choice of one of the four orientations (as defined above). The principle of relativity in the 
above form does not require the independence of physics of this choice. As is well-known 
there are exceptions not conforming to this extended demand. 

VIII. RELATIVE VELOCITIES AND THEIR COMPOSITION 

To be precise the term 'four-velocity', although deeply rooted in the language usually 
used in SR, is somewhat misleading. In fact the vector u of an inertial frame simply points 
in the direction in which time flows but there is no space translation for all observers in this 
frame. To introduce a more justified notion of velocity one needs a reference observer which 
'rests'. But 'all observers are equal', so one has to say with respect to which of them one 
makes the measurement. 

Thus we assume there are given two four-velocities u and u' and we want to determine 
a velocity of the motion defined by u' with respect to that defined by u. We propose three 
candidates: 

1° A{u',u) = u' -u, 
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2° Vpr{u',u) = <, 

3° v{u',u) = u'jDtu{u'). 
The r.h.s. in 2° is formed as in ([7| and (14) and the subscript 'pr' stands for 'proper'. In 3° 



Dtu is independent of u in the Galilean case. 

The first of these definitions satisfies the antisymmetry and chain properties: 

A(m» = -A(m,m'), A{u" ,u) = A{u" ,u') + /\{u' ,u) , (16) 

which has obvious interpretational advantages. 

A. Galilean spacetime 

In this case Dt{u') = 1 and u'^ = u' — Dt{u')u = u' — u, so all three definitions coincide 
and we shall use notation v{u',u) for this quantity (see Fig. 11). We have v{u',u) G S and 
point 3° above tells us that this vector gives the change of position of an observer with four- 
velocity u' with respect to one with four velocity u, undergone in unit time. The composition 



of velocities obeys simple vector addition law (16) (see Fig. 12 



B. Special relativity 

In this case all three definitions are different (see Fig. 13). The first one has the advantage 
of the vector addition composition law ([T6| (see Fig. 14), but A{u',u) does not lie in any of 
the subspaces Su or Su'- Rather, it is in the subspace of the observer with four-velocity 
'half way' between u and u': w = {u + u')/ \/{u + u') ■ [u + u'). 

The second and the third definitions give parallel vectors in S^- The proper velocity 
v^^{u\u) is the displacement of the motion along any world-fine P + L{u'), as seen in the 
M-frame, undergone during unit time interval as measured on the world-fine (proper time) 
(see Fig. 15). The velocity v{u\u) is a similar displacement but scaled to unit time in 
ff-frame. It is only this latter quantity which is bounded by 1 (light velocity as defined in 



Section VII). 



The explicit form of the two latter velocities is easily obtained: 

Vpr{u' ,u) = U — U ■ UU , (17) 



U 



v(u' , u) = u . (18) 

u' ■ u 
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Neither of these velocities satisfies the antisymmetry or the chain rule properties (16). If we 
write the first of these equations in the form u' = u' ■ uu + v^t- and take the scalar square of 
both sides we find 

{U ■ Uf - \v^r? = I (19) 

(from now on we write = v^^iv! ,u)^ v = v{u' ,u)). This tells us that the quantities u' ■ u 
and |fpr| may be represented as the hyperbohc cosine and hyperbohc sine of some unique 
parameter > 0. If we denote k = expip > 1 we get the representation 

u' ■u = l{k + k-^) = c{k), \vpr\ = l{k - k'^) = s{k) , \v\ = ^. (20) 

Some other useful relations which follow are 

1 



c{k) = ^/l + \v,,\^ = ^===, s{k) = ^AA=, (21) 



\v 



1 + \v\ 



1/2 



/c = |vl + Vl + lvP= (^Y^J • ^^^^ 
We shall find the direct physical interpretation of k in the next section. 

The magnitude of k is invariant with respect to the interchange of u and u', so if we 
denote v'p^ = Vpr{u,u') and v' = v{u,u') then we have 

l^prl = Ivl ' W\ = 1^1 • (23) 

The motion of an observer with respect to the w-frame is often defined rather in terms of 
Vpr or V than u', or similarly with the role of observers interchanged, and then 

u = c{k)u + fpr = c{k){u + v) = c{k)u + s{k)n , 

(24) 

u = c{k)u' + v'p^ = c{k){u' + v') = c{k)u' + s{k)n' , 

where by n and n' we have denoted the unit spacelike vectors pointing in the direction of 
V and v' respectively. Although the use of fpr or v instead of u' may seem better suited for 
the point of view of the w-frame, one has to be careful not to project Galilean properties of 
velocities to SR. For instance, we have v' ^ — f , in contrast to GS. 

The composition of velocities of these types is rather complicated and not very illumi- 
nating. The special case of four-velocities m, m', m" lying in one two-dimensional subspace 
will be discussed in the next section. 
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IX. TIME MEASUREMENT 



The problem one wants to address here is the foUowing. Two events P and Q on a world 
line with four velocity u' are separated by the vector At'u', so the time interval between 
them as measured directly by the inertial observer on this world-hne is At'. What time-span 
At win be measured between these events in the frame defined by the four-velocity m? 



A. Galilean spacetime 

Here the answer is simple. The spacetime is equipped with the universal time interval 
form Dt, so there is no doubt how to measure this interval in any frame. One has 

At = Dt{At'u) = At' . (25) 



B. Special relativity 



If one employs the frame-dependent time interval form Dtu described in Section VI B 
one finds 

At = Dtu{At'u') =u-u' At' = c{k)At' , (26) 

(notation as in the preceding section). This gives the famous 'time dilation' effect. However, 
one should be careful to interpret this result properly. No inertial observer from the w-frame 
can pass directly both events P and Q, thus the measurement in this frame is by necessity 
indirect. Observers on the world-hnes P + L{u) and Q + L{u) to estabhsh one frame- 
dependent time variable t^ need only to agree on a choice of a constant time hypersurface to 
synchronize their clocks (as the time-interval form Dtu is known directly to both of them). 
After this has been settled (see below) the time t„(P) is measured directly by the first 
observer, and the time tu{Q) is measured directly by the other. The difference tu{Q) ~tu{P) 
gives At. See Fig. 16. 

The synchronization of clocks can be done by the radar method. The first observer sends 
at his time ti a light signal towards the other one and receives it back refiected at ^2- Denote 
by X the event on the world-hne of the first observer at his time (ti + t2)/2, and by Y 
the event on the world-line of the second observer at which the reflection of the light ray 
takes place, see Fig. 17. If h and I2 are lightlike vectors as depicted in the figure, then 
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(^2 — ti)u — li + I2, XY — (li — h)/'^, so u ■ XY — 0. Thus X and Y lie in one hyperplane 
of ^-simultaneity and if the second observer agrees to set his clock for {ti + 12)/2 at Y, the 
clocks will be synchronized. 

In real life the time dilation measurement is rarely, if at all, done this way. Probably 
the most famous instance of the dilation effect is the decay of muons produced by cosmic 
radiation coming to Earth. Muons are unstable particles with a characteristic lifetime (in 
their rest-frames). They are produced with known energy (so also known velocity) by scat- 
tered cosmic rays. One finds that their mean lifetime in the Earth-frame is much longer 
than the characteristic one. However, what is directly measured is not any time at all! One 
measures the distance they cover during their life; then knowing their relative velocity in 
the Earth-frame one calculates their lifetime in this frame. 

Another type of time measurement is by registering the time of arrival of light signals. 
Suppose that two inertial observers travel along world-lines P + L{u') and P + L{u) respec- 
tively (thus we assume for simplicity that they meet at P). Let both of them set their clocks 
so as to show at P. The u'-observer sends a light signal towards the ^-observer at his time 
t' , which arrives at the w-obscrvcr's world-line at the time t+ on that line. Thus one has the 
equation t'u' -\- 1 — t+u, where I is the lightlike, future-pointing vector connecting these two 
events (see Fig. 18). We write this as 

l^t+u- t'u' , I -1^0, l-u>0. (27) 

Solving the second equation for t+ one obtains two values out of which the third condition 
selects only one: 

t+ =u-u't' + ^J{u■u'y -l\t'\ = c{k)t' + s{k)\t'\ . (28) 

Note that t', t+ < for observers approaching each other (parts of world-lines causally 
preceding P) and t', t+ > for observers moving away from each other (parts of world-lines 
causally following P). Let now the w'-observer send two signals at times t\ and t'2 > t'^, 
either both negative or both positive, and denote Ai' — t'2 — t'^, Ai+ — t+2 — t+i- Then one 
finds from the above relation that 

At_i_ = k^^At' observers moving towards each other, 

(29) 

At^ — kAt' observers moving away from each other . 
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Note that the result is completely different from the 'dilation effect'. 

The above connections have a directly observable physical consequence. The light is a 
wave phenomenon; the change of its phase from one ray to another is the same for each of 
the above observers. But the times corresponding to the given change of phase, say 2tc, are 
related as above. Thus the frequencies of light and u for the two observers are related by 
u = ku' observers moving towards each other , 

(30) 

V = k v' observers moving away from each other . 



With the interpretation of /c-coefiicient given by the second equation in (29) we can now 
find a simple formula for the composition of velocities (or rather their lengths) in the special 
case of three co-planar four-velocities m, m', m". Let the fc-coefiicients be denoted as in 



Fig. 19. This figure then also shows that K = kk' . Using the last equation in (20) and 



Eq. (l22j) one finds 

I / // N| \v{u',u)\ + \v{u'\u')\ . 

\V(U ,U)\ = — i-. 31) 

We end this section with a warning against a popular error in graphical representations 
of the time dilation found in many introductory texts on SR. One of many variants is this: 
an individual A is speeding in a rocket towards (or away from) another individual B, who 
is busy with some activity. Each of the individuals is equipped with a clock and A watches 
(by 'looking') B's activity. The claim then is that A will measure B's activity to last longer 
then it lasts for B in agreement with the time dilation formula. This, however, is wrong; in 
fact A receives light signals from B, so his measurement will give a result obeying one of the 



cases in Eqs. (29). In fact, for approaching observers, the time in question is shorter. 



X. SPACE MEASUREMENT 



Here we pose the following question. Two parallel world-lines with four-velocity u' are 
separated by a vector z' which is a 'pure space' vector in the w'-frame. What is the 'pure 
space' vector z which separates them in the frame defined by u? These two vectors may 
be thought of as connecting two particles in a rigid body in these two frames. This latter 
notion has limitations in SR: it runs into difficulty when accelerations are involved, and then 
needs an input of dynamics to be modified. However, as long as only inertial motions are 
involved, a rigid body may be identified with some family of parallel world-lines. This body 
rests in the frame defined by these world-lines. 
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A. Galilean spacetime 



Here again the answer is simple: the 'pure space' directions are universally determined 

by S, so 

z = z' eS. (32) 

B. Special relativity 

In this case 'pure space' means that u' ■ z' = u ■ z = 0. The condition for z to connect 
the same two world-lines is z = z' + Xu' with some real A. Taking the scalar product of this 
equation with u we find this coefficient and obtain 

z = z u' . (33) 

u' ■ u 

These two vectors can be decomposed as 

z = z'^ + a'n' , z = z± + an , (34) 



where z'^ is orthogonal to u' and n' (as defined at the end of Section IX), zj^ is orthogonal to 



u and n, and a, a' are numerical constants. Note that z'^ and z^ are equivalently identified 



as parts of z' and z orthogonal both to u and u' . Taking the scalar product of Eq. (33) with 



u' we find z ■ u' = —z' ■ u/u' ■ u. Using now Eqs. (24) and (34) we find after some simple 
algebra 



a' 



z^ = z'^, a = -^. (35) 

The second of these equations describes the effect of the so called 'length contraction', 
whose popular formulation could run as: 'the dimensions parallel to the relative velocity 
measured by the moving observer are by the factor 1/c{k) shorter then those measured by 
the observer in rest with respect to the object being measured'. However, one should note 
that this formulation and the term 'contraction' are somewhat misleading: 

1° The vectors z' and z connect two different pairs of events on the two world-lines 
considered, nothing is being 'contracted'. Events separated by z' are simultaneous in 
the rest frame of the 'rigid body', while those separated by z are simultaneous for the 
moving observer. 
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2° The vectors n' and n (pointing in the directions of the two respective velocities) are 
not even parallel, so for each of the frames the term 'parallel to the velocity' means 
something different. 

Figure 20 illustrates the situation for the special case -z^ = ^± = 0, which means that for 
the M-observer the rigid rod with ends on the two world-lines moves parallelly to its axis. 

The proper understanding of the above dismisses various 'length contraction paradoxes' 
in SR.I^ The key to all of them is a cautious analysis of the relation between various vectors 
involved in the problem. 

We illustrate this with a geometrical situation whose variants lie at the base of most of 
these effects. Suppose we have two pairs of parallel world-lines: P + L{u'), Q + L{u'), and 
P + L{u), Q + L{u), so that the first lines in these pairs intersect at P, and the second lines 
intersect at Q. Physically this may be thought of as modeling two rigid rods in relative 
motion, the ends of the first and the second rod described by the lines in the first and in the 
second pair respectively. The 'front' ends of the rods meet at some point and similarly the 
'back' ends meet at some other point. Let z' and w be the 'pure space' vectors (in respective 
rest-frames) connecting the ends of rods and denote x = PQ. (See Fig. 21. The picture 
might suggest that the rods are bound to clash and cannot 'go through'. This is because we 
lack in the picture the fourth dimension, which may be used to shghtly detach the rods.) 
Then one has 

X = z' + jji'u = w + vu (36) 



with some constants jJ , v. We decompose z' as in the first Eq. (34) and similarly write 



w = I5n , w = ^ n' , (37) 

cik) 



(the second formula obtained in analogy with Eqs. (34) and (35) is written down for later 



use). As n and n! can be expressed as hnear combinations of u and v! (see Eq. (24)), the 



consistency condition for the second equation in (36) is 



A = w^, (38) 

and then the constants /i' and v have unique solutions, which we do not need to write down 
explicitly. 

The geometry of the situation is clear and no interpretational difficulty arises if one insists 
on this four-dimensional picture. However, if one uses the 'length contraction' language 
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'paradoxes' easily arise. Suppose, for instance, that the vector x is spacelike (as in Fig. 21) 
and consider any four-velocity orthogonal to x. Then the intersecting of lines has this 
interpretation: in each of these frames the two rods pass each other parallelly, with both 
respective ends simultaneously coming into contact. But now the 'paradoxical' problem 
arises: if we go to some other frame not in this family, then due to different velocities of the 
two rods they will change their size in different way, so the ends cannot meet. The simple 
explanation is, of course, that what is simultaneous in one frame usually is not simultaneous 
in another, which falsifies the above conclusion. And even more, the rods moving parallelly 
in one frame usually do not remain parallel in another. 

To illustrate the last point suppose that in the above geometrical setting x = w, i.e. 
the rods are parallel and of equal length in the w-frame. This means that w = z, and 



decomposing these vectors as before we find a' = —c{k)(3. Using this and Eq. (38) we find 

w' = w± ^ n , z = w_L — c(A;)/9 n . (39) 

cik) 

These vectors are parallel if, and only if ti;_L = or /5 = 0. In all other cases rods move in 
the w'-frame askew to each other. This is illustrated in Fig. 22. 

XI. NON-INERTIAL MOTIONS, PROPER TIME, SIMULTANEITY 

Inertial motions, as we have seen, have a special role to play for the interpretation of the 
geometry of spacetime. However, the picture would not be complete without mentioning 
other, non-inertial, motions. Straight lines are special examples in the more general class of 
curves. A regular curve may be defined as a set of points obtained as values of a differentiable 
mapping A t— -P(A), where A is a real parameter taking values in some (finite or not) interval 
on the real axis. The curve is invariant under a change of parameter A = /(A'), where / 
is differentiable together with its inverse. Each regular curve has at each its point -P(A) 
a tangent vector defined as dP{X)/d\. The extension of tangent vectors changes with the 
change of parameter (but the tangent straight lines they generate remain unchanged). 

We now define a general world-line as a curve with a four-velocity as its tangent vector 
at each its point. We say that r is a proper time of a world-line if it has the form t ^ P{t) 
and the equation 

= <r) (40) 
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defines at each point the tangent four- velocity u{t). Physically proper time intervals are 
measured by clocks traveling along the world-line. Integrating the above equation one ob- 
tains 



T2 



P1P2 = / u{t) dr , where Pi = P{n) . (41) 



Tl 



Note that sums of four-velocities are future-pointing timelike vectors, so P2 is in the future 
of Pi. One introduces also the concept of the four- acceleration: 

Note that acceleration, hke relative velocity, points in a 'purely spatial' direction: 

GS: Dt{a{T)) = ^Dt{u{T)) = , 

dr (43) 



SR: Dt^(^^){aiT)) = iz(r) ■ a(r) = = 0. 



However, unlike relative velocity, the acceleration is absolute - it does not need a reference 
observer. 

We now want to find 

1° what is the relation of the proper time to affine time functions defined earlier, 
2° does the presence of acceleration influence the concept of simultaneity? 

A. Galilean spacetime 



We apply the hnear form Dt to both sides of Eq. (41 ) and find 



t{P2)-t{Pr)=Dt{l\P2)= r Dt{u{r))dT = T2-n. (44) 

Thus the proper time intervals are identical with the absolute time intervals. Also, the 
notion of simultaneity is in no way infiuenced by accelerations. 



B. Special relativity 

Here we take the form Dtu and then proceed as in the Galflean case to find 

tu{P2) - tu{Pl) = r U- u{t) dT>T2-n. (45) 
Jti 
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Therefore the proper time interval is always smaller than any aflfine time function interval, 
except for the case when u{t) = u. The latter case gives simply P(r) = P{ti) + (r — ti)u, 
which is an inertial motion; proper time intervals are then equal to the w-inertial time 
intervals on that line. In general this is not the case. However, put ti = T2 = r + dr and 
u = u{t). Then we find 

tu{r){P{r + dr)) - t^ir){P{r)) = dr , (46) 

so locally the proper time interval is equal to the time interval as defined earlier for inertial 
motions. 

With accelerated motions in play it is now possible to let two general observers start 
from Pi, take different routes, and then meet again at P2. In general their clocks will show 
different time intervals between these two events. In particular, let the first observer go 
straight from Pi to P2 along an inertial world-line, and let u be his four-velocity. Then his 
clock will show the interval tu{P2) — tu{Pi)-, which is always more than the reading of the 
proper time interval for any accelerated observer. There is no paradox here (the famous 
'twin paradox') - the accelerations, as noted above, are absolute, so there is no symmetry 
between the observers. 

Consider now simultaneity. Suppose that for an observer on the world-line P(t) we can 
extend this notion in the way determined by his local position and four-velocity: event X is 



from his point of view simultaneous with the event P(t) iff P(r)X • u{t) = 0. However, this 
leads to conceptual difficulties. To see this suppose the observer crosses Pi with four-velocity 
ui and then P2 with four-velocity U2- The two corresponding simultaneity hyperplanes cross 
on the 2-plane of events X determined by the linear system 



PiX ■Ui = 0, i = l,2. (47) 



Take any event X on this 2-plane and put X- = X + P^X. We have PjX- = 2PjX, so X[ 



is simultaneous with Pj. At the same time there is X[X'2 = — P1P2. Therefore X'2 is in the 
past of X[. Thus an event which according to the above definition is simultaneous with Pi 
turns out to be in the future of an event simultaneous with a later event P2 (see Fig. 23). 

This difficulty should by no means be interpreted as an argument against the objectivity 
of the 'direction of time flow'. This latter notion should be simply identified with the choice 



of the causal orientation and the emerging partial order Q > P, as discussed in Section [IV 
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The difficulty rather points to the weakness of the notion of simultaneity, its restricted 
applicability and, to some degree, its conventional character. It also shows that the strict 
'dilation' and 'contraction' problems are of rather academic nature. 

XII. FOUR-MOMENTUM, FOUR-ANGULAR MOMENTUM AND THEIR CON- 
SERVATION 

The four-momentum of a particle with mass mi and four- velocity Ui is given by 

Pi = TTliUi . (48) 

If one chooses a reference point O and Xi is a vector from this point to the position of the 
particle then the four-momentum tensor is defined by 

Li = 2xi A pi . (49) 

Let pi, . . . ,pkhe the initial and p'l, ■ ■ - p'l the final four-momenta in a conservative mechanical 
process. The invariant laws of momentum and angular momentum conservation say 

k I k I 

^-E^^;-' E^^ = E^;- (50) 

1=1 j=l i=l j=l 

A. Galilean spacetime 

Here the mass is an invariant of the four-momentum given by rui = Dt{pi). The decom- 
position of the four-momentum with respect to the frame defined by the four-velocity u is 
thus 

Pi^miu + piu (51) 

see Fig. 24. We see thus that the law of conservation of mass and the law of conservation of 
momentum are aspects of one observer-invariant law of conservation of four-momentum. 

B. Special relativity 

The mass again is an invariant, but formed in another way: pi ■ pi — mf. Then in the 
u-frame we have 

Pl = Ely,U+Ply, , - \piy_f = Ulj , (52) 
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see Fig. 25. Eiu has the interpretation of the energy as seen in the chosen frame. Now the 
aspects of the observer-invariant law of conservation of four-momentum are laws of energy 
and momentum conservation, while the sum of masses needs not to be conserved. 
We observe that geometrical analogy is: 

Galilean mass <-> Einsteinian energy 

(and not energy energy). This analogy is further confirmed when one considers the time- 
space part of the conservation of four-angular momentum. For freely moving particles one 
obtains the law of uniform motion of center of mass in the Galilean case, and of center of 
energy in the SR case. 



XIII. GALILEAN KINETIC ENERGY 



The question then arises what is the geometrical status of the Galilean kinetic energy 
and does its conservation have an invariant character. 

To answer this observe that while there is no geometrical numerical invariant formed out 
of space-part of a single timelike vector, one can form a respective invariant for a pair of 
such vectors. Let Dt{pi) = nii, i = 1,2, and let u be any four-velocity. Then pi = rriiU + Piu, 
so that 

^-^ = ^-^e5. (53) 

mi m2 mi m2 



Thus the number 

2 



mim2 

d{Pl:P2) = 



Plu P2u 

mi m2 



> (54) 



2 

does not depend on u (see Fig. 26). For momenta pi, ■ ■ ■ ,Pk it is now easy to show, that 

k 

J2 d{Pi,Pj) = 2ME - |P„|' > , (55) 

i,j=l 

where 
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P-J2p^^ P-Mu + P,, E = J2^-^- (56) 

We learn two facts: 



. , . T 2mj 

1=1 1=1 



1° If the total four-momentum is conserved, then the condition of energy conservation is 
Galilean invariant. 
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2° There is always E > |P„p/2M, and the equality holds if, and only if, all momenta are 
parallel. 



XIV. CELESTIAL SPHERE 



We fix a reference point O and consider all light rays coming into this point. Imagine 
a world-line of an inertial observer with four-velocity u passes through this point. At this 
point the observer positions the space directions from which all light rays arrive. We want 
to find how the picture obtained in this way depends on the four- velocity u of the observer. 



A. Galilean spacetime 

Here we assume that the light rays propagate with infinite speed. Thus the straight lines 
of the rays lie in the hyperplane O + S, and their directional vectors are in S. But for such 
vectors the decomposition ([T]) is trivial and independent of u. Therefore the picture formed 
by fight on the celestial sphere is independent of the choice of particular observer crossing 
the point O. 



B. Special relativity 

A fight ray with the directional past-pointing vector —I & V comes from the space 
direction pointed by the unit spacefike vector 

where we have used the fact that |4p = — = (see Fig. 27). If u' is the four- 

velocity of another observer passing O and we denote for brevity r = r{l,u), r' = r{l,u') 
then we find 

u' ■ I 

J = {u — r) ■ u' = c{k) + s{k) nor. (58) 



Using this and Eq. (57) for r and r' we find the transformation r ^-^ r' of the celestial sphere 



of the M-observer to the sphere of the w'-observer: 



T — ?/ 

r' = u'+ . (59) 

c{k) + s[k) nor 



23 



Taking the scalar product of this equation with u we find, in particular, the well-known 
aberration formula: 

, , s(k) + c(k) nor ,„ . 

n'or' = - , 60 

c[K) + s[k) nor 

(the difference in signs is due to the direction of n and n'). 

A small variation of the direction of the light ray induces small variations Sr and Sr', which 
are tangent to the two respective celestial spheres. The linear transformation 6r t— > 6r' is 



found by varying Eq. (59): 



r / '^'^ noSr . 

"^^^ = 77\ ^ FTT^ 77\ [u - rj . (611 

c{k) + s{k)nor [c{k) + s{k) n o r]^ ^ ' ^ ' 

Taking now two different variations b\ and and using the constraints u ■ br = r ■ br = ^ 
we find 

b,r' o 5,r' = ^-^^ . (62) 

' ' [c{k) + s{k)nor]^ ^ ' 

This equation tells us that the linear transformation br h-* br' differs only by the factor 
[c(A;) + s{}i) nor\~^ from an isometric transformation. Thus locally (in the first order in br) 
the picture registered on the celestial sphere scales by this factor without a change of the 
shape (the angles).!^ 

Larger areas on the celestial sphere lose this scahng property and undergo more compli- 
cated transformations. However, one feature of the local transformation survives. To find 
it chose a spacehke vector < 0, and consider among vectors — / all those which satisfy 
the equation 

z-l = 0. (63) 

Using the geometrical quantities correlated to u the spacelike character of z is written down 
as {u ■ z)'^ < and the above condition on Vs takes the form 

r{l,u) o = — J — - = cos[(f){z,u)] , (64) 

I I I I 

where the last equality defines the angle (j){z,u). This equation tells us that the vectors 
r{l,u) are all those which form the angle (p{z,u) with the vector Zu/\zu\- Thus they form 
a circle on the celestial sphere. This fact is independent of the choice of a particular observer 
(its vector u) crossing the point O. However, the angle (l){z,u) does depend on this choice. 



Note in particular that if Eq. (63) determines a 'great circle' for the observer with four- 



velocity u (i.e. (t){z,u) = 7r/2), this circle will in general cease to be 'great' for the one with 
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the four-velocity u' . The exceptional cases when 'great' goes to 'great' are those determined 
by z orthogonal both to u and u' . 

To summarize, the picture obtained on the celestial sphere undergoes deformation from 
one observer to another, but in such a way that angles are conserved and circles become 
circles, although the 'greatness' property is usually not conserved. This is illustrated in 
Figs. 28 and 29. 
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XVI. APPENDIX 

Theorem. The cone V determines uniquely up to a constant factor a symmetric bilinear 
form g such that x eV <^=^ g{x,x) = 0. 

Proof. In a canonical basis V takes the form given in Eq. Q, which is equivalent to 
x° = ±-^^^=1(3^')^- If this imphes g{x,x) = Ylliu=o9f^i^^^^'^ ^ 0' ^^^^ conditions 

{gik + gooS^k) x'x" ± 2g^,^lYLJ^^ = 

must be satisfied identically (for any numbers x\ i = 1,2,3). Thus goi = 0, i = 1,2,3, and 
gik + gooSik = 0, i,k = 1, 2, 3. Therefore in this frame g{x, y) = goo - Ya^i ^'v') ■ ^ 
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Fig. 1. Reference frames - a popular picture. 
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Fig. 2. Vector and afiine space. 
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Fig. 3. Causal structure. 
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Fig. 4. Inertial motions. 
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past 



Fig. 5. Past, future, elswhere. 




PXoPY I PX|=[PXoPX]^ 



Fig. 6. Metric structure of GS. 
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XX > 




XX > 

Fig. 7. Scalar product in SR. 




Fig. 8. Metric structure of SR. 




Fig. 9. Four-velocities and future-directed lightvectors. 
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Fig. 10. Subspaces orthogonal to 4-velocities. 




t=0 




Fig. 16. Time measurement. 

t2 

(ti+t2)/2 ^ 

Fig. 17. Synchronization of clocks. 
At=kAt' / 




Fig. 18. Time of arrival of light signals. 
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Fig. 19. Composition of ^-coefficients for co-planar four-velocities: k/1 = K/k', so 

K = kk'. 
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Piu 



- |Piul^ = m^^ 
Fig. 25. Four-momentum in SR. 




P2/m2-Pi/mi = f^u/n^2-Piu/"^i 



Fig. 26. Galilean invariant of two causal vectors: \p2u/m2 — Piw/^^il 
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Fig. 27. Celestial sphere. 
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Fig. 28. A bicycle wheel in rest. 



Fig. 29. The same wheel as seen by a fast moving observer. 
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